Independent Study Report: Propose Partial Replication Schemes for Replicated Declustering and Compare Their Performance

نویسنده

  • Ali Tekeoǧlu
چکیده

In this course, under the supervision of our professor, we have focused on the techniques for declustering data into multiple disks. Our aim is to get familiar with all the related work done till today so we have started from the very beginning of declustering which is done on relational databases and Cartesian product files [4, 5]. We have explored the literature and find out the most outstanding, milestone quality papers about declustering, read them and have small discussions each weak on one or two papers. This way we tried to understand the increased interest and need on declustering of data to multiple disks, which was parallel to the increase in the amount of data used in today’s high performance, high complexity computing environments. Those environments utilize multiple disks controlled with multiple processors compared to single disk and single processor machines used at the early times of declustering research. Today, algorithms and approaches have adjusted to multiple disk systems that generally store spatial data and frequently receive range queries. Today’s database systems manipulate terabytes of data related to different fields of sciences like cartography, epidemiology, transportation and Geographical Information Systems [10]. We have observed several approaches aimed at decreasing the amount of time required for retrieval of data under different types of queries. Most of the queries fall into one of the groups; range query, arbitrary query and connected query. Those queries are investigated and several algorithms proposed for each of them separately in the literature. Our initial reference point for partial replication schemes was the research paper of our professor which analyses and compares all of the replicated declustering schemes proposed till now. We examined the comparisons of different methods for replicated declustering under different query types and loads.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Selective Replicated Declustering for Arbitrary Queries

Data declustering is used to minimize query response times in data intensive applications. In this technique, query retrieval process is parallelized by distributing the data among several disks and it is useful in applications such as geographic information systems that access huge amounts of data. Declustering with replication is an extension of declustering with possible data replicas in the...

متن کامل

Threshold-based declustering

Declustering techniques reduce query response time through parallel I/O by distributing data among multiple devices. Except for a few cases it is not possible to find declustering schemes that are optimal for all spatial range queries. As a result of this, most of the research on declustering has focused on finding schemes with low worst case additive error. However, additive error based scheme...

متن کامل

Global Mobility Management by Replicated Databases in Personal Communication Networks

This paper explores the use of replicated databases for management of customer data (e.g., mobility data, call routing logic) in global, intelligent and wireless networks. We propose and analyze two, full and partial, data replication schemes which are compatible with industry protocol standards and compare them with the traditional, centralized database scheme. By identifying a set of key tele...

متن کامل

- 1 - Global Mobility Management by Replicated Databases in Personal Communication Networks

This paper explores the use of replicated databases for management of customer data (e.g., mobility data, call routing logic) in global, intelligent and wireless networks. We propose and analyze two, full and partial, data replication schemes which are compatible with industry protocol standards and compare them with the traditional, centralized database scheme. By identifying a set of key tele...

متن کامل

cient Disk Allocation for Fast Similarity Searching

As databases increasingly integrate non-textual information it is becoming necessary to support eecient similarity searching in addition to range searching. Recently, declustering techniques have been proposed for improving the performance of similarity searches through parallel I/O. In this paper, we propose a new scheme which provides good declus-tering for similarity searching. In particular...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007